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Abstract. In this paper, an event network is presented for exploring open in¬ 
formation, where linguistic units about an event are organized for analysing. 
The process is divided into three steps: document event detection, event network 
construction and event network analysis. First, by implementing event detection 
or tracking, documents are retrospectively (or on-line) organized into document 
events. Secondly, for each of the document event, linguistic units are extracted 
and combined into event networks. Thirdly, various analytic methods are pro¬ 
posed for event network analysis. In our application methodologies are presented 
for exploring open information. 


1 Introduction 

Exploring information in open fields is a crucial and challenging task for human beings 
in the age characterized by flooding of information. Gigantic volumes of data spring 
up all over the world. This information is availably for us by connecting into the In¬ 
ternet. Events (e.g. the Ebola virus, the Islamic state) outbroken far away from us may 
affect our ordinary life. We no longer turn a blind eye to world events, that we are get¬ 
ting involved. We not only consume information, but also produce it. Social media (e.g., 
Twitter) makes us easily and freely express opinions. Millions of comments spread over 
the Internet, affecting the trend of public opinions. Accessing this information is bene¬ 
ficial for our decision-makings. Commonly, Information Retrieval (IR) and Information 
Extraction (IE) provide helpful solutions for us to handle the surge of information. 

When exploring information, event oriented techniques provide effective approaches 
to understand who, where, when and what happened. Output of event detection in IR is 
series (or clusters) of documents, referred as document events in this paper. The problem 
with IR systems is that, they only retrieve subject-documents, but no content of docu¬ 
ments is identified. Users are required to skim through returned documents. Events of 
IE are templates with slots to be filled, referred to as template events. Template event 
recognition extracts structured data from semi-structured or unstructured data. It is ex¬ 
pected that the result is used to populate a knowledge base directly. The main problem 
for template event recognition is that it suffers from poor performance, especially in an 
open field, where heterogeneous resources, noise and fragmental data are processed. 

In open fields, there are knowledge bases automatically constructed to support in¬ 
formation exploring. Many of them are constructed from semi-structured database (e.g. 
Wikipedia, WordNet) or under human supervision (collaboratively). Therefore, a bet¬ 
ter consistency is expected. They are widely used as external knowledge for semi- 
supervised methods. Because semi-structured data or human labor are required, when a 


news is broken in open fields automatically handling this information in real time is dif¬ 
ficult. Furthermore, many systems automatically organize extracted linguistic units into 
a graph-based representation. These systems generally lead to a complex network with 
thousands of nodes or edges. Rare analysis was conducted to show underlying struc¬ 
tures of events. Generally, output generated by IE systems is error-prone, redundant 
and incompatible information makes it contradictory. In this paper, analyses of event 
network are emphasized. Contributions of this paper include, 

1. An event network framework is proposed, which provides an event oriented infor¬ 
mation exploring and supports various analyses. 

2. Two novel analytic methods (PET analysis and action analysis) are discussed for 
event oriented information exploring. 

The rest of this paper is organized as follows. Section |2] introduces related work. 
Motivations and definitions about the event network are presented in Section[2 Section 
|4] discusses our method to construct event networks. Section |5] demonstrates applica¬ 
tions of event network. Section|6]gives conclusions. 


2 Related Work 


Organizing linguistic units as networks or graphs shows an increased interest in NLP. 
These representations can be roughly divided into three paradigms; logic based seman¬ 
tic network (e.g. Ontology), scalable knowledge database (e.g. Ereebase) and semantic 
network constructed from open information extraction. 

Logic based semantic network refers to networks constructed with human labor. 
They mainly focus on a closed domain. Eor example, conceptual graph represents 
logic as a graph representation (ISowal 1198411 . These networks support logic opera¬ 
tors, can map questions and assertions from natural language to a relational database. 
Logic based semantic networks are constructed by domain experts and used as domain- 
specific ontologies (e.g., WordNet, Cyc, etc.). Commonly, they support inferences de¬ 
veloped in knowledge representation. In this paradigm, conflicting and contradictory 
are not allowed. Therefore, in open fields it is difficult to apply. 

In the second paradigm, logic based s emantic network is ex tended into open fields. 
Large knowledge databases such as Yago (ISuchanek et al.Ll2007h . Ereebase (IBollacker et al 
2008h are constructed, representing large knowledge in formalized forms. These repre¬ 
sentations merge diverse and heterogeneous data with high scalability, providing a uni¬ 
fied framework for organizing information. Instead of rigid definitions, these networks 
have no canonical view of data. They use a loose representation to support scalability 
and extensibility. Many of them are constructed by merging ontologies (e.g. WordNet, 
OpenCyc) or extracted from semi-structured database (e.g. Wikipedia). To construct 
logic based semantic network, commonly, direct or indirect human interventions are 
involved (e.g. collaborative methods or searching logs). 

Instead of aiming at semi-structured data, the third paradigm explores information 
in an open and dyn amic field (mainly focusing on un s tructu red data). In this paradigm, 
wea k supervision jMintz e t all W09 Xu and Zhaol 20141) and bootstrapping meth 
ods ( Agichtein and Gravanol ^OOl : iKozareva and Hovvi ^lOt McIntosh et al.L 2011 

























Weld et al.L 120091) are emp loyed. Scalable kn owledge data bases are used to guide th e 


proces s, e.g., TextRunn er ( Banko et al. , 2007 ), KnowItAll ( Etzioni et al.L 2005 , 201 1 ), 
WOE ( Hoffmann et al.L 201ol) and StatSnowBall ( Zhu et al. . 2009h . Commonly, in these 
systems, nodes are named entities and edges are relations between them. All extracted 
results are combined into a large network. It often generate a network containing more 
than thousands of nodes and edges. 

The notion of eyent is widely used for exploring information. jPiskorski et al.l (1201 ih 
presents an on-line news eyent extraction system. Each event is dehned as a frame with 
slots hlled by infor mation extracted from cluste red documents, where the pattern match¬ 
ing method is used. lRamakrishnan et al.l (120141) proposes an EMBERS system encoding 
events as frames. It is used to forecast “ civil unrest” event s in open helds. TwiCal ex¬ 
tracts open-domain events from Twitter (IRitter et al.Ll2012h . where events are identified 
by named entities. In the same data set, ET represe nts eyents as clusters of keywords 
( Parikh and Karlapalemi 2013 ). Kuzey et al. ( 2014 ) uses events itself as nodes of net¬ 
work. They cluster documents into a hierarchical repr esentation, where no tes are events 
and edges link the same event in chronological order. lAngel et al. ( 2012 ) constructs an 
entity network from social media by the streaming edge we ight method. They mine 
dense subgraphs of network for identifying realtime stories. Das Sarma et al.l ( 2011 ) 
provides an event discovery method based on entity dynamic relation graphs, which are 
constructed by co-occurrences of entities constrained in documents. 


3 Motivation and Definition 


Methodologies to organize linguistic units into networks or graphs show an increased 
interest in NLP. They provide novel solutions for many NLP tasks and support human 
oriented information exploring. Representing linguistic units as a graph enables topo¬ 
logical analyses developed in helds such as: social network and complex network. 

In open helds, these representations are mainly constructed by techniques devel¬ 
oped under information extraction or text understanding. Information extraction aims 
at extracting linguistic units with concrete concepts or functions. It is seen as a trade¬ 
off between information retrieval and text understanding, where text understanding tries 
to capture all information in a document . Text understanding may lead to worsen per¬ 
formance caused by applied techniques (IHobbs and Riloffl l2010h . On the other hand, 
information extraction extracts targeted units and ignores uninterested. 

Due to extracting challenges in open helds, instead of extracting information in a 
monolithic process, we divide the task into three steps: document event detection, event 
network construction and event network analysis. In the hrst step, by implementing 
document event detection and tracking, documents are organized into document events. 
Most of irrelevant or uninterested information is hltered. Then, in the second step, IE 
techniques are employed to extract linguistic units and organized into event networks. 
Techniques with higher performance are highlighted. Eor example, instead of extract¬ 
ing named entities as nodes, entity mentions are used. Where coreference resolution are 
required to group entity mentions into named entities, which is error-prone. In the last 
step, because topological information is available, structural information between lin- 



















































guistic units can be used to modify the network quality. Then, network or graph based 
analytic methods can be used, and it is convenient for visualization. 

In this domain, many systems combine extracted results into a complex network, 
which not doing much help for analysing information. Furthermore, redundant and in¬ 
compatible information makes it contradictory and misunderstanding. In our applica¬ 
tion, we emphasize methodologies conducted for event network analysis. Advantages 
of the event network include: first, after document event detection, information extrac¬ 
tion in each document event can be independently implemented. Therefore, the effect 
of noise and heterogeneous data on information extraction can be reduced. Secondly, 
crossing document information enables discovery of potential relations between docu¬ 
ments. Thirdly, event network provides a structured data representation for exploring 
open information, topological methods, e.g., social network or complex network, can 
be introduced for event network analysis. 

For convenience to discuss event network analysis, we define nodes and edges of 
event network as frames with slots. These information is also expected to support human 
oriented information exploring. 

Let ^={di, ■ ■ ■ , di} be a document set, di denotes a document. A document event 
(ak is a subset of For all di, dj G Sk, similarity function Similarity{di, dj) satisfy 
a predefined condition (e.g. a threshold). All document events in ^ are denoted as 
S'={S‘i, ■ ■ ■ , (ox}. The constraint that # is a partition of ^ is not necessary, because 
some documents in ^ can be filtered, or fuzzy partitioning techniques can be used, 
which enable a document belonging to more than one document event. 

An event network on document event is represented as a graph ^k={Vk, Ek}, 
where Vk={vki, • • • , ffcAr} and Ek={eki, • • • , c^m} are vertex set and edge set. Both 
vertices and edges are frames defined as follows. 

vertex := {key, name, type, weight, inf o} 
edge := {type, t;-l, v-2, weight, info} 

where vertex frame defines nodes of event network. Slot “name” refers to entity men¬ 
tions occurred in a document event. Each vertex is identified by an integer value “key”. 
Slot “type” represents categories of vertices (e.g. Person, Organization and Location). 
Slot “weight” is the likelihood of “name” to be “type”. Traditionally, this value is 
given by a classifier when extracting this frame. Depending on real applications, “weight” 
can be used to filter an event network. An edge frame denotes a relation between two 
vertexes. Slots “v-1” and “v-2” are keys of vertices in an edge, used to identify vertices 
linked by edges. Edge types are referred by “type” (e.g. Part-whole, Personal-Social). In 
both frames, slot “info” contains information about the frames, where entity mentions 
or entity relations occurred, e.g., sentences, documents or timestamps. These informa¬ 
tion support event network analyses (e.g. coreference resolution, statistical relational 
learning or manually exploring). If they are empty, these values are null. 


4 Implementation 

This section discusses our method to construct event networks, which are used to show 
methodologies discussed in Section|5] 


4.1 Data Sets 


We use the ACE 2005 Chinese corpus. It contains 633 documents annotated with 15,264 
entities and 33,932 entity mention^. 7 entity types (e.g. person, organization, etc.) and 
44 entity subtypes are defined. The corpus also annotated with 6 major relation types 
and 18 relation subtypes. Each relation instance has two named entities as arguments. 
There are 9,244 relation mentions are collected as positive instances. 

The ACE 2005 Chinese corpus is used to train named entity and relation classifiers. 
In order to show our method in an open field, we also use the Chinese Gigaword Eifth 
Edition corpus. The Peoples Daily source is used, which contain 145,001 newswire 
texts covering the period from November 2006 through December 2010. 


4.2 Events Detection 


The purpose of document event detection is t o clus ter documents into events. We use 
EDA toolkit provided by iPhan and NguvenI OOOTh to implement this task. In EDA 
model, a corpus is first represented as a matrix, where each column refers to a doc¬ 
ument vector, and each row represents distribution of a term in documents. Then EDA 
maps documents from a term space into a topic space. Topics are hidden variables. 

Because we focus on newswi re texts, wh e re sho rt texts are commonly used. We use 
Omni-word feature proposed by Chen et al. ( 2014h . which takes every potential word 
as terms of documents. It is a subset of n-Gram feature. In the pretreatment process, we 
remove high and low frequency wordflin an employed lexicon. Words with frequencies 
lower than 10 are also removed. To train an EDA model, hyper-parameters are required. 
The topic number is set as 25. Other parameters use default settings. 

The toolkit generates several outputs. The word-topic distributions are more favourable 
to us, which give distributions of terms in a topic space. We use topics as centroids of 
document clusters in a term space. When clustering documents, a documents belonging 
to an event is judged by the nearest Euler Distance of the document and centroids. The 
top 100 most likely words per each topic are used to represent an event. 

It is recognized that documents discussing the same event tend to be temporal prox- 
imity, and a time g ap between bursts of similar documents may indicate different events 
( Yang et aU 1999 ). Therefore, timestamps are used to partition the newswire texts. In 
our experiment, the time step is set as 5 months. Then, the Chinese Gigaword corpus is 
divided into 10 parts. Each part contains 5 months newswire texts. Because hierarchi¬ 
cal representation can give a multi-granularity review when exploring open information 
and reduce the travel cost. In each time step, instead of using retrospective methods to 
give a flat partition of documents, we organize them into a hierarchical representation. 
Documents of each time step are clustered into 25 events by the EDA toolkit. Each 
event is further clustered into sub-events by the same approach. If an event contains 
documents less than ten, the process to find its sub-events is skipped. Therefore, in each 
time step, 25 events and at most 25 x 25 sub-events are detected. 


’ An entity mention is a reference to an entity. 
^ The ratio is 5% for each. 














4.3 Named Entity Recognizing 

In this step, it is free to use any named entity recognition methods. In our application, 
we use a Boundary Assembling (BA) method to implement the named entity recogni¬ 
tion task. The notion of BA method is that, instead of recognizing entity mentions in 
a unitary style, it first detects boundaries of entity mention, then assembles detected 
boundaries into entity mention candidates. Each candidate is further assessed by a clas¬ 
sifier. 

In our work, we recognize three types of named entity: “PER” (Person), “LOC” 
(Location) and “ORG” (Organization). In order to filter noise, recognized named enti¬ 
ties with Chinese characters less than two and more than six are discarded. 


4.4 Relation Recognizing 


To rec ognize relations between named entities, we adopt the method proposed in IChen et al. 
( 20141) . where an Omni-word feature and a soft constraint method is proposed for Chi¬ 
nese relation extraction. The Omni-word feature uses every potential word in a relation 
mention as lexical features. Then for each employed atomic feature, an appropriate con¬ 
straint condition is selected to combine them with additional information to maximize 
the classification determination. 

With our employed three entity types, five relation types annotated in the ACE cor¬ 
pus are recognized: “PER-SOC”, “GEN-AEE”, “ORG-AEE”, “PART-WHOLE” and 
“PHYS”. Sentences with more than ten entities are ignored, because extracting rela¬ 
tions in a long sentence is error-prone. 


4.5 Merging and Visualizing 

As approaches discussed above, the result about recognized document events, named 
entities and relations are listed in Table [T] 


Table 1: Information of Results 


Step 

Doc. 

Event 

Entity 

Relation 

Step 

Doc. 

Event 

Entity 

Relation 

0 

14,814 

642 

1,123,506 

309,847 

5 

19,706 

643 

1,389,668 

290,594 

1 

11,734 

642 

897,489 

258,817 

6 

12,014 

645 

927,977 

265,439 

2 

17,678 

644 

1,104,517 

257,291 

7 

9,100 

641 

705,805 

254,133 

3 

18,213 

646 

1,305,191 

277,721 

8 

9,326 

635 

726,123 

230,535 

4 

23,433 

644 

1,704,294 

289,166 

9 

8,983 

640 

674,457 

243,210 


In the f ollows, in order to conduct event network analysis, we use the igraph toolkit 
provide by ICsardi and Nenuszl (120061) . representing extracted enti ty mentions and re¬ 
lations as a graph. A network analysis toolkit (Pajek) provided bv IBatageli and Mrvai 
(Il998l) is used for visualization. 
























5 Application 


In our work, we emphasize analyses of event networks. After event networks were 
constructed, techniques such as social network, complex network can be employed to 
analyse event networks. For example, setting a person name as a central entity, we can 
navigate entities around it. Filtering irrelevant information, we can show character rela¬ 
tionships in an event. Using the “PART-WHOLE” relations, multi-granularity visualiza¬ 
tion can be supported. Because event networks have a graph representation, topological 
information is available. Therefore, various approaches (e.g., statistical relational learn¬ 
ing) can be used to improve the network quality. Furthermore, event networks support 
human oriented information exploring. When human exploring open information, man¬ 
ual interventions can be used to modify the quality of event networks. 

In this section, we choose a document event in time step 0 as an example, which 
contain 1,041 documents. There are 42,436 named entities and 6,272 relations occurred. 
The most likely words in it are etc.” (Assault, NATO, 

Spokesman, Conflict, Defence, etc.). It indicates that the concern of this event is military 
affairs. Extracted named entities and relations are organized in Eigure[Tl where there are 
252 nodes and 571 edges are merged. Nodes in Red, Yellow and Blue colors represent 
Person, Organization and Location respectively. Each edge is labelled by the relation 
type. 



Eig. 1: Event Network 


To explore open information, many systems dynamically organize linguistic units 
into a complex network. Heterogeneous resources and unreliable information make the 
network chaotic and misunderstanding. As Eigure[T]showing, a complex network makes 
it difficult to understand. In the following, based on the event network, we give four 
methodologies to explore open information: Information Eiltering, PET Analysis, Ac¬ 
tion Analysis and Social Network Analysis. 

5.1 Information Filtering 

The simplest way to analyse event network is to Alter information that is irrelevant or 
uninterested. In Pigure [2(a^ only person names and “PER-SOC” relations are remained 
to show character relationships in an event network. 
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(a) Information Filtering 


(b) PLA Analysis 


This example can be formalized as: Let ^ be an event network. The filtered event 
network ^'={V', U'} is a subgraph of such that c/L' C c/L. And satisfies 
Vv € V'(v.type = PER) A Ve G E'{e.type = PER-SOC). 

Using information contained in vertex frames and edge frames, information filter¬ 
ing can provide effective approaches for exploring open information. Eor example, in 
vertex frames and edge frames, we may require that the value in weight slots is greater 
than a predefined threshold. Utilizing information in info slots, we can collect named 
entities occurred in specified periods or areas. Eor a central figure, we can see directly 
connected named entities and relations between them. 


5.2 PUT Analysis 

Person-Location-Time (PLT) analysis tries to find relations between persons and loca¬ 
tions in a period of time. It can be used to track a person, find trajectories of targeted 
entities. Person, Location are extracted by named entity recognition methods. While 
the Time is different. Two kinds of Time are distinguished in a document: implicit 
temporal information and explicit temporal information. Implicit temporal information 
is part of the document’s content indicating the creation, development, termination of 
an event. It is also seen as a named entity type in some researches. Extracting this in¬ 
formation needs information extraction or text understanding techniques. In many ap¬ 
plications, it is ignored. Generally, in open fields, all documents have explicit temporal 
information, which includes the creation, modification and transmission timestamps of 
documents. They are meta-data spread with documents. In our paper, because we focus 
on newswire texts, where the explicit temporal information of documents is released 
together. Therefore, we use the explicit temporal information for PLT analysis. 

This process can be formalized by introducing an attribute time in the info slot. 
Let aL be an event network, times represent timestamps and person is a person name. 

, E'} is the result of PLT analysis based on aL, where Ve G E'{e.type = 
PHYS A (e.u-1 = person V e.v-2 = person)). In other words, all relation type in 
E' is “PHYS”, and take the same entity mention person as an argument. Replacing all 
person by corresponding times, we get a graph with nodes referred to timestamps and 
locations. An example is shown in Pigure[2(b)| 







In this example, we track Mao Zedong (“^'?$]^”ilin the whole Gigaword corpus, 
collect all recognized “PHYS” relation instances which have Mao Zedong as an argu¬ 
ment. In the result, there are 142 “PHYS” relation mentions, which take Mao Zedong 
(or Chairman Mao) as arguments. Then we replace Mao Zedong (or Chairman Mao) 
by the explicit temporal information of news wire texts. In Figure [2(bffl nodes in green 
color are timestamps, and blue nodes are locations. Each green node means that Mao 
Zedong occurred with the connected locations at that time. 


5.3 Action Analysis 

Recognizing an “event” under the ACE dehnition is difficult, where event triggers, par - 


ticipant roles, properties and attributes sho uld be identihed (iDoddington et al.L 120041) . 


It received an ACE value score only 30% in Ahn ( 2006h . In an open held, it will come 
to worse performance. In many researches, co-occurrence information (e.g. co-citation, 
co-word, co-link, etc.) between ter ms is used to explore and underst and structures in 
the underlying document sets, e.g., Levdesdorff and VaughanI ( 2006 ). In our applica¬ 
tion, instead of the dehnition in ACE, we present the action analysis. 

In action analysis, we focus on detecting whether or not a special action is men¬ 
tioned in a sentence. Therefore, we conduct the “sentence classihcation” task, class¬ 
ing each sentence by a classiher trained on the ACE annotated event mentions. In our 
application, we monit or the “Conhict” ACE ev ent type, which has two substype: At¬ 
tack and Demonstrate (IDoddington et al.Ll2004l) . The ACE corpus, which annotates 596 
“Conhict” events, is employed for training and testing. We implement the 5-fold cross 
validation, and the P/R/E (Precision/Recall/E-score) measurement. E-score is computed 
by (2 X P X R)/{P -f R). In order to perform a two-class classihcation, we gener¬ 
ate negative instances by segmenting the corpus into sentences, discarding annotated 
ACE event mentions, and hltering sentences without event triggers of “Conhict” ACE 
events. Then, 1,589 sentences are collected as negative instances. We only use Omni¬ 
words features in sentences for classihcation. The performance is shown in Row 1 of 
Table 12] where Only the performance about “Conhict” is listed. 


Table 2: Performance of Action Analysis 


No. 

Precision 

Recall 

F-core 

1 

82.19 

82.88 

82.53 

2 

97.65 

41.94 

58.68 


In an open held with massive data, the precision is more emphasized. Therefore, we 
label an instance as a “Conhict” action only when the employed classiher (maximum 
entropy) output a predicted value equals 1 q The performance is shown in Row 2 of 

^ The leader of the Communist Party of Chinese. 

Because the original graph is more complex (55 nodes and 142 edges), in this place, only part 
of it is given. 

^ The default value is 0.5 in two-class classihcation. 
























Table 12 We use this setting to train a classifier and predict every sentence in document 
events. Entity co-occurrences in each “Conflict” sentence are calculated. The result is 
shown in Figure [2(^ 



(c) Action Analysis 


(d) Social Network Analysis 


Figure [2(^ shows the result about the employed event. Edges in this example indi¬ 
cate co-occurrence relations between entities. In this event, there are 12,076 sentences 
containing at least two entities,where 836 sentences have the “Conflict” action with 
value 1 outputted by the classifier. Among them, 3,221 entities co-occurred. In order to 
make the result more comprehensible, edges with co-occurrence frequencies less than 
12 are erased. Finally, a network with 25 entities is generated. In this example, entities 
(e.g., “Uh^Dt” (Hamas), (the Gaza Strip), (Afghanistan), 

¥” (U.S. forces)) and the edges between them surely show meaningful information. 

5.4 Social Network Analysis 

Techniques (e.g.. Short Path, Cohesive Subgroup, Center, etc.) proposed in social net¬ 
work mainly implemented on a network constructed by domain experts. A precise net¬ 
work is required to discover the underlying structure of social network. Because event 
networks are automatically extracted. They are error-prone. Therefore, for some of these 
techniques, it is difficult to get a reliable output. However, some results generated by 
social network also show meaningful information for us. In Figure [2(d)l an example is 
given. 

Data in this example comes from results of Information Filtering and PLT analysis. 
The left of Figure [2(d)| seeks a short path between (Hamid Karzai) and 

“|S|^@P” (the Secretary of State). They are connected by “PER-SOC” relations. On 
the right, “Mao Zedong” is set as the central figure to show directly collected locations, 
e.g., “#|^lL|” (Jinggangshan). 

6 Conclusion and Future Work 

Event network is a framework for exploring open information. In this paper, based on 
the employed data set, we show applications of event network for information analyses. 
In future work, based on event network, more analyses can be developed to support 
exploring open information. 
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